feat: Add AmortizedVIPosterior for amortized variational inference by janfb · Pull Request #1751 · sbi-dev/sbi

janfb · 2026-02-02T06:37:53Z

Summary

This PR implements amortized variational inference via a new AmortizedVIPosterior class, addressing #909.

Note on AI usage: I used Claude Code to help implementing this. I did many iterations and careful reviewing myself and with a very critical Codex 5.2 reviewer.

Context

The existing VIPosterior trains an unconditional variational distribution q(θ) for a fixed observation x_o. This requires retraining for every new observation, which is inefficient in scenarios requiring inference across many observations.

Amortized VI addresses this by learning a conditional distribution q(θ|x) that generalizes across observations. Once trained on simulation data (θ, x), the posterior can provide instant samples for any new x without retraining.

As part of this PR, we also align MAP estimation with the base posterior logic (potential-based) and keep sampling output shapes consistent across posteriors.

Implementation

We introduce AmortizedVIPosterior, which trains a conditional normalizing flow q(θ|x) by optimizing the ELBO against a potential function from NLE/NRE:

from sbi.inference import AmortizedVIPosterior, ZukoFlowType

posterior = AmortizedVIPosterior(
    potential_fn=potential_fn,
    prior=prior,
    flow_type=ZukoFlowType.NSF,
)

posterior.train(theta, x, max_num_iters=1000)

# Works for any observation without retraining
samples = posterior.sample((1000,), x=x_new)
samples_batch = posterior.sample_batched((1000,), x=x_batch)

A ZukoFlowType enum provides type-safe selection of flow architectures (NSF, MAF, NAF, UNAF, SOSPF, NICE, GF, NCSF, BPF).

Design Choices

Separate class vs extending VIPosterior

We created a new class rather than adding an amortized mode to VIPosterior:

Clear separation of concerns (unconditional vs conditional flows)
Different training signatures (train() vs train(theta, x))
Avoids conditional logic complexity in VIPosterior

ZukoFlowType enum scoping

The ZukoFlowType enum is defined in sbi.neural_nets.factory and reused here:

Covers Zuko flows with efficient log_prob (NSF, MAF, NAF, UNAF, SOSPF, NICE, GF, NCSF, BPF)
Keeps a shared, Zuko-specific enum for other builders

Potential function interface

The implementation uses the existing potential_fn.set_x() pattern for efficiency. This makes the class non-thread-safe, which is documented in the class docstring.

Testing

We validate correctness using a linear Gaussian problem where the true posterior is
analytically known. Tests verify:

Accuracy via C2ST against ground truth samples
Comparison with standard VIPosterior on the same problem
Gradient flow through the ELBO to all flow parameters
Correct error handling for edge cases (missing x, untrained model)
MAP estimation returns high-density regions
Validation batching options to control ELBO evaluation cost

Closes #909

…ing in training; add gradient flow test

…curacy comparison

…r; update references in implementation and tests

- Reuse Zuko flow enum, - align MAP with potential-based logic, - tighten sampling/validation behavior while updating tests and docs.

- Fix VIPosterior.to() to return self for method chaining (matches AmortizedVIPosterior) - Add theta dimension validation in AmortizedVIPosterior.train() to catch mismatches early - Remove ZukoFlowType from top-level sbi.inference exports (still available via sbi.inference.posteriors) - Update test imports accordingly 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

codecov · 2026-02-02T06:44:52Z

❌ 15 Tests Failed:

Tests completed	Failed	Passed	Skipped
5837	15	5822	146

View the top 3 failed test(s) by shortest run time

tests/mnle_test.py::test_mnle_api[none-mdn-vi]

Stack Traces | 0.048s run time

flow_model = 'mdn', sampler = 'vi'
mcmc_params_fast = MCMCPosteriorParameters(method='slice_np_vectorized', thin=1, warmup_steps=1, num_chains=1, init_strategy='resample', init_strategy_parameters=None, num_workers=1, mp_context='spawn')
z_score_theta = 'none'

    @pytest.mark.parametrize(
        "sampler", (pytest.param("mcmc", marks=pytest.mark.mcmc), "rejection", "vi")
    )
    @pytest.mark.parametrize("flow_model", ("mdn", "nsf", "zuko_nsf"))
    @pytest.mark.parametrize("z_score_theta", ("independent", "none"))
    def test_mnle_api(
        flow_model: str,
        sampler,
        mcmc_params_fast: MCMCPosteriorParameters,
        z_score_theta: str,
    ):
        """Test MNLE API."""
        # Generate mixed data.
        num_simulations = 10
        theta = torch.rand(num_simulations, 2)
        x = torch.cat(
            (
                torch.rand(num_simulations, 1),
                torch.randint(0, 2, (num_simulations, 1)),
            ),
            dim=1,
        )
    
        # Train and infer.
        prior = BoxUniform(torch.zeros(2), torch.ones(2))
        x_o = x[0]
        # Build estimator manually.
        theta_embedding = FCEmbedding(2, 2)  # simple embedding net
        density_estimator = likelihood_nn(
            model="mnle",
            flow_model=flow_model,
            z_score_theta=z_score_theta,
            embedding_net=theta_embedding,
        )
        trainer = MNLE(density_estimator=density_estimator)
        trainer.append_simulations(theta, x).train(max_num_epochs=1)
    
        # Test different samplers.
>       posterior = trainer.build_posterior(prior=prior, sample_with=sampler)

tests/mnle_test.py:132: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
.../trainers/nle/mnle.py:176: in build_posterior
    return super().build_posterior(
.../trainers/nle/nle_base.py:291: in build_posterior
    return super().build_posterior(
.../inference/trainers/base.py:507: in build_posterior
    self._posterior = self._create_posterior(
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <sbi.inference.trainers.nle.mnle.MNLE object at 0x7f8ccfd896f0>
estimator = MixedDensityEstimator(
  (net): NFlowsFlow(
    (net): Flow(
      (_transform): CompositeTransform(
        (_transfo...bias=True)
      (1): ReLU()
      (2): Linear(in_features=50, out_features=2, bias=True)
      (3): ReLU()
    )
  )
)
prior = BoxUniform(Uniform(low: torch.Size([2]), high: torch.Size([2])), 1)
sample_with = 'vi', device = 'cpu'
posterior_parameters = VIPosteriorParameters(q='maf', vi_method='rKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent')

    def _create_posterior(
        self,
        estimator: ConditionalEstimator,
        prior: Distribution,
        sample_with: Literal[
            "mcmc", "rejection", "vi", "importance", "direct", "sde", "ode"
        ],
        device: Union[str, torch.device],
        posterior_parameters: PosteriorParameters,
    ) -> NeuralPosterior:
        """
        Create a posterior object using the specified inference method.
    
        Depending on the value of `sample_with`, this method instantiates one of the
        supported posterior inference strategies.
    
        Args:
            estimator: The estimator that the posterior is based on.
            prior: A probability distribution that expresses prior knowledge about the
                parameters, e.g. which ranges are meaningful for them. Must be a PyTorch
                distribution, see FAQ for details on how to use custom distributions.
            sample_with: The inference method to use. Must be one of:
                - "mcmc"
                - "rejection"
                - "vi"
                - "importance"
                - "direct"
                - "sde"
                - "ode"
            device: torch device on which to train the neural net and on which to
                perform all posterior operations, e.g. gpu or cpu.
            posterior_parameters: Configuration passed to the init method for the
                posterior. Must be of type PosteriorParameters.
    
        Returns:
            NeuralPosterior object.
        """
    
        if isinstance(posterior_parameters, DirectPosteriorParameters):
            posterior_estimator = estimator
            if not isinstance(posterior_estimator, ConditionalDensityEstimator):
                raise TypeError(
                    f"Expected posterior_estimator to be an instance of "
                    " ConditionalDensityEstimator, "
                    f"but got {type(posterior_estimator).__name__} instead."
                )
            posterior = DirectPosterior(
                posterior_estimator=posterior_estimator,
                prior=prior,
                device=device,
                **asdict(posterior_parameters),
            )
        elif isinstance(posterior_parameters, VectorFieldPosteriorParameters):
            vector_field_estimator = estimator
            if not isinstance(vector_field_estimator, ConditionalVectorFieldEstimator):
                raise TypeError(
                    f"Expected vector_field_estimator to be an instance of "
                    " ConditionalVectorFieldEstimator, "
                    f"but got {type(vector_field_estimator).__name__} instead."
                )
            if sample_with not in ("ode", "sde"):
                raise ValueError(
                    "`sample_with` must be either",
                    f" 'ode' or 'sde', got '{sample_with}'",
                )
            posterior = VectorFieldPosterior(
                vector_field_estimator=vector_field_estimator,
                prior=prior,
                device=device,
                sample_with=sample_with,
                **asdict(posterior_parameters),
            )
        else:
            # Posteriors requiring potential_fn and theta_transform
            potential_fn, theta_transform = self._get_potential_function(
                prior, estimator
            )
            if isinstance(posterior_parameters, MCMCPosteriorParameters):
                posterior = MCMCPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, RejectionPosteriorParameters):
                posterior = RejectionPosterior(
                    potential_fn=potential_fn,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, VIPosteriorParameters):
>               posterior = VIPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    prior=prior,
                    device=device,
                    **asdict(posterior_parameters),
E                   TypeError: VIPosterior.__init__() got an unexpected keyword argument 'num_transforms'

.../inference/trainers/base.py:899: TypeError

tests/mnle_test.py::test_mnle_api[independent-mdn-vi]

Stack Traces | 0.05s run time

flow_model = 'mdn', sampler = 'vi'
mcmc_params_fast = MCMCPosteriorParameters(method='slice_np_vectorized', thin=1, warmup_steps=1, num_chains=1, init_strategy='resample', init_strategy_parameters=None, num_workers=1, mp_context='spawn')
z_score_theta = 'independent'

    @pytest.mark.parametrize(
        "sampler", (pytest.param("mcmc", marks=pytest.mark.mcmc), "rejection", "vi")
    )
    @pytest.mark.parametrize("flow_model", ("mdn", "nsf", "zuko_nsf"))
    @pytest.mark.parametrize("z_score_theta", ("independent", "none"))
    def test_mnle_api(
        flow_model: str,
        sampler,
        mcmc_params_fast: MCMCPosteriorParameters,
        z_score_theta: str,
    ):
        """Test MNLE API."""
        # Generate mixed data.
        num_simulations = 10
        theta = torch.rand(num_simulations, 2)
        x = torch.cat(
            (
                torch.rand(num_simulations, 1),
                torch.randint(0, 2, (num_simulations, 1)),
            ),
            dim=1,
        )
    
        # Train and infer.
        prior = BoxUniform(torch.zeros(2), torch.ones(2))
        x_o = x[0]
        # Build estimator manually.
        theta_embedding = FCEmbedding(2, 2)  # simple embedding net
        density_estimator = likelihood_nn(
            model="mnle",
            flow_model=flow_model,
            z_score_theta=z_score_theta,
            embedding_net=theta_embedding,
        )
        trainer = MNLE(density_estimator=density_estimator)
        trainer.append_simulations(theta, x).train(max_num_epochs=1)
    
        # Test different samplers.
>       posterior = trainer.build_posterior(prior=prior, sample_with=sampler)

tests/mnle_test.py:132: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
.../trainers/nle/mnle.py:176: in build_posterior
    return super().build_posterior(
.../trainers/nle/nle_base.py:291: in build_posterior
    return super().build_posterior(
.../inference/trainers/base.py:507: in build_posterior
    self._posterior = self._create_posterior(
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <sbi.inference.trainers.nle.mnle.MNLE object at 0x7f8cd411d5a0>
estimator = MixedDensityEstimator(
  (net): NFlowsFlow(
    (net): Flow(
      (_transform): CompositeTransform(
        (_transfo...     (1): ReLU()
        (2): Linear(in_features=50, out_features=2, bias=True)
        (3): ReLU()
      )
    )
  )
)
prior = BoxUniform(Uniform(low: torch.Size([2]), high: torch.Size([2])), 1)
sample_with = 'vi', device = 'cpu'
posterior_parameters = VIPosteriorParameters(q='maf', vi_method='rKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent')

    def _create_posterior(
        self,
        estimator: ConditionalEstimator,
        prior: Distribution,
        sample_with: Literal[
            "mcmc", "rejection", "vi", "importance", "direct", "sde", "ode"
        ],
        device: Union[str, torch.device],
        posterior_parameters: PosteriorParameters,
    ) -> NeuralPosterior:
        """
        Create a posterior object using the specified inference method.
    
        Depending on the value of `sample_with`, this method instantiates one of the
        supported posterior inference strategies.
    
        Args:
            estimator: The estimator that the posterior is based on.
            prior: A probability distribution that expresses prior knowledge about the
                parameters, e.g. which ranges are meaningful for them. Must be a PyTorch
                distribution, see FAQ for details on how to use custom distributions.
            sample_with: The inference method to use. Must be one of:
                - "mcmc"
                - "rejection"
                - "vi"
                - "importance"
                - "direct"
                - "sde"
                - "ode"
            device: torch device on which to train the neural net and on which to
                perform all posterior operations, e.g. gpu or cpu.
            posterior_parameters: Configuration passed to the init method for the
                posterior. Must be of type PosteriorParameters.
    
        Returns:
            NeuralPosterior object.
        """
    
        if isinstance(posterior_parameters, DirectPosteriorParameters):
            posterior_estimator = estimator
            if not isinstance(posterior_estimator, ConditionalDensityEstimator):
                raise TypeError(
                    f"Expected posterior_estimator to be an instance of "
                    " ConditionalDensityEstimator, "
                    f"but got {type(posterior_estimator).__name__} instead."
                )
            posterior = DirectPosterior(
                posterior_estimator=posterior_estimator,
                prior=prior,
                device=device,
                **asdict(posterior_parameters),
            )
        elif isinstance(posterior_parameters, VectorFieldPosteriorParameters):
            vector_field_estimator = estimator
            if not isinstance(vector_field_estimator, ConditionalVectorFieldEstimator):
                raise TypeError(
                    f"Expected vector_field_estimator to be an instance of "
                    " ConditionalVectorFieldEstimator, "
                    f"but got {type(vector_field_estimator).__name__} instead."
                )
            if sample_with not in ("ode", "sde"):
                raise ValueError(
                    "`sample_with` must be either",
                    f" 'ode' or 'sde', got '{sample_with}'",
                )
            posterior = VectorFieldPosterior(
                vector_field_estimator=vector_field_estimator,
                prior=prior,
                device=device,
                sample_with=sample_with,
                **asdict(posterior_parameters),
            )
        else:
            # Posteriors requiring potential_fn and theta_transform
            potential_fn, theta_transform = self._get_potential_function(
                prior, estimator
            )
            if isinstance(posterior_parameters, MCMCPosteriorParameters):
                posterior = MCMCPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, RejectionPosteriorParameters):
                posterior = RejectionPosterior(
                    potential_fn=potential_fn,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, VIPosteriorParameters):
>               posterior = VIPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    prior=prior,
                    device=device,
                    **asdict(posterior_parameters),
E                   TypeError: VIPosterior.__init__() got an unexpected keyword argument 'num_transforms'

.../inference/trainers/base.py:899: TypeError

tests/mnle_test.py::test_mnle_api[none-nsf-vi]

Stack Traces | 0.096s run time

flow_model = 'nsf', sampler = 'vi'
mcmc_params_fast = MCMCPosteriorParameters(method='slice_np_vectorized', thin=1, warmup_steps=1, num_chains=1, init_strategy='resample', init_strategy_parameters=None, num_workers=1, mp_context='spawn')
z_score_theta = 'none'

    @pytest.mark.parametrize(
        "sampler", (pytest.param("mcmc", marks=pytest.mark.mcmc), "rejection", "vi")
    )
    @pytest.mark.parametrize("flow_model", ("mdn", "nsf", "zuko_nsf"))
    @pytest.mark.parametrize("z_score_theta", ("independent", "none"))
    def test_mnle_api(
        flow_model: str,
        sampler,
        mcmc_params_fast: MCMCPosteriorParameters,
        z_score_theta: str,
    ):
        """Test MNLE API."""
        # Generate mixed data.
        num_simulations = 10
        theta = torch.rand(num_simulations, 2)
        x = torch.cat(
            (
                torch.rand(num_simulations, 1),
                torch.randint(0, 2, (num_simulations, 1)),
            ),
            dim=1,
        )
    
        # Train and infer.
        prior = BoxUniform(torch.zeros(2), torch.ones(2))
        x_o = x[0]
        # Build estimator manually.
        theta_embedding = FCEmbedding(2, 2)  # simple embedding net
        density_estimator = likelihood_nn(
            model="mnle",
            flow_model=flow_model,
            z_score_theta=z_score_theta,
            embedding_net=theta_embedding,
        )
        trainer = MNLE(density_estimator=density_estimator)
        trainer.append_simulations(theta, x).train(max_num_epochs=1)
    
        # Test different samplers.
>       posterior = trainer.build_posterior(prior=prior, sample_with=sampler)

tests/mnle_test.py:132: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
.../trainers/nle/mnle.py:176: in build_posterior
    return super().build_posterior(
.../trainers/nle/nle_base.py:291: in build_posterior
    return super().build_posterior(
.../inference/trainers/base.py:507: in build_posterior
    self._posterior = self._create_posterior(
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <sbi.inference.trainers.nle.mnle.MNLE object at 0x7f2e041f0b80>
estimator = MixedDensityEstimator(
  (net): NFlowsFlow(
    (net): Flow(
      (_transform): CompositeTransform(
        (_transfo...bias=True)
      (1): ReLU()
      (2): Linear(in_features=50, out_features=2, bias=True)
      (3): ReLU()
    )
  )
)
prior = BoxUniform(Uniform(low: torch.Size([2]), high: torch.Size([2])), 1)
sample_with = 'vi', device = 'cpu'
posterior_parameters = VIPosteriorParameters(q='maf', vi_method='rKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent')

    def _create_posterior(
        self,
        estimator: ConditionalEstimator,
        prior: Distribution,
        sample_with: Literal[
            "mcmc", "rejection", "vi", "importance", "direct", "sde", "ode"
        ],
        device: Union[str, torch.device],
        posterior_parameters: PosteriorParameters,
    ) -> NeuralPosterior:
        """
        Create a posterior object using the specified inference method.
    
        Depending on the value of `sample_with`, this method instantiates one of the
        supported posterior inference strategies.
    
        Args:
            estimator: The estimator that the posterior is based on.
            prior: A probability distribution that expresses prior knowledge about the
                parameters, e.g. which ranges are meaningful for them. Must be a PyTorch
                distribution, see FAQ for details on how to use custom distributions.
            sample_with: The inference method to use. Must be one of:
                - "mcmc"
                - "rejection"
                - "vi"
                - "importance"
                - "direct"
                - "sde"
                - "ode"
            device: torch device on which to train the neural net and on which to
                perform all posterior operations, e.g. gpu or cpu.
            posterior_parameters: Configuration passed to the init method for the
                posterior. Must be of type PosteriorParameters.
    
        Returns:
            NeuralPosterior object.
        """
    
        if isinstance(posterior_parameters, DirectPosteriorParameters):
            posterior_estimator = estimator
            if not isinstance(posterior_estimator, ConditionalDensityEstimator):
                raise TypeError(
                    f"Expected posterior_estimator to be an instance of "
                    " ConditionalDensityEstimator, "
                    f"but got {type(posterior_estimator).__name__} instead."
                )
            posterior = DirectPosterior(
                posterior_estimator=posterior_estimator,
                prior=prior,
                device=device,
                **asdict(posterior_parameters),
            )
        elif isinstance(posterior_parameters, VectorFieldPosteriorParameters):
            vector_field_estimator = estimator
            if not isinstance(vector_field_estimator, ConditionalVectorFieldEstimator):
                raise TypeError(
                    f"Expected vector_field_estimator to be an instance of "
                    " ConditionalVectorFieldEstimator, "
                    f"but got {type(vector_field_estimator).__name__} instead."
                )
            if sample_with not in ("ode", "sde"):
                raise ValueError(
                    "`sample_with` must be either",
                    f" 'ode' or 'sde', got '{sample_with}'",
                )
            posterior = VectorFieldPosterior(
                vector_field_estimator=vector_field_estimator,
                prior=prior,
                device=device,
                sample_with=sample_with,
                **asdict(posterior_parameters),
            )
        else:
            # Posteriors requiring potential_fn and theta_transform
            potential_fn, theta_transform = self._get_potential_function(
                prior, estimator
            )
            if isinstance(posterior_parameters, MCMCPosteriorParameters):
                posterior = MCMCPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, RejectionPosteriorParameters):
                posterior = RejectionPosterior(
                    potential_fn=potential_fn,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, VIPosteriorParameters):
>               posterior = VIPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    prior=prior,
                    device=device,
                    **asdict(posterior_parameters),
E                   TypeError: VIPosterior.__init__() got an unexpected keyword argument 'num_transforms'

.../inference/trainers/base.py:899: TypeError

tests/mnle_test.py::test_mnle_api[independent-nsf-vi]

Stack Traces | 0.097s run time

flow_model = 'nsf', sampler = 'vi'
mcmc_params_fast = MCMCPosteriorParameters(method='slice_np_vectorized', thin=1, warmup_steps=1, num_chains=1, init_strategy='resample', init_strategy_parameters=None, num_workers=1, mp_context='spawn')
z_score_theta = 'independent'

    @pytest.mark.parametrize(
        "sampler", (pytest.param("mcmc", marks=pytest.mark.mcmc), "rejection", "vi")
    )
    @pytest.mark.parametrize("flow_model", ("mdn", "nsf", "zuko_nsf"))
    @pytest.mark.parametrize("z_score_theta", ("independent", "none"))
    def test_mnle_api(
        flow_model: str,
        sampler,
        mcmc_params_fast: MCMCPosteriorParameters,
        z_score_theta: str,
    ):
        """Test MNLE API."""
        # Generate mixed data.
        num_simulations = 10
        theta = torch.rand(num_simulations, 2)
        x = torch.cat(
            (
                torch.rand(num_simulations, 1),
                torch.randint(0, 2, (num_simulations, 1)),
            ),
            dim=1,
        )
    
        # Train and infer.
        prior = BoxUniform(torch.zeros(2), torch.ones(2))
        x_o = x[0]
        # Build estimator manually.
        theta_embedding = FCEmbedding(2, 2)  # simple embedding net
        density_estimator = likelihood_nn(
            model="mnle",
            flow_model=flow_model,
            z_score_theta=z_score_theta,
            embedding_net=theta_embedding,
        )
        trainer = MNLE(density_estimator=density_estimator)
        trainer.append_simulations(theta, x).train(max_num_epochs=1)
    
        # Test different samplers.
>       posterior = trainer.build_posterior(prior=prior, sample_with=sampler)

tests/mnle_test.py:132: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
.../trainers/nle/mnle.py:176: in build_posterior
    return super().build_posterior(
.../trainers/nle/nle_base.py:291: in build_posterior
    return super().build_posterior(
.../inference/trainers/base.py:507: in build_posterior
    self._posterior = self._create_posterior(
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <sbi.inference.trainers.nle.mnle.MNLE object at 0x7f8ccfdbba00>
estimator = MixedDensityEstimator(
  (net): NFlowsFlow(
    (net): Flow(
      (_transform): CompositeTransform(
        (_transfo...     (1): ReLU()
        (2): Linear(in_features=50, out_features=2, bias=True)
        (3): ReLU()
      )
    )
  )
)
prior = BoxUniform(Uniform(low: torch.Size([2]), high: torch.Size([2])), 1)
sample_with = 'vi', device = 'cpu'
posterior_parameters = VIPosteriorParameters(q='maf', vi_method='rKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent')

    def _create_posterior(
        self,
        estimator: ConditionalEstimator,
        prior: Distribution,
        sample_with: Literal[
            "mcmc", "rejection", "vi", "importance", "direct", "sde", "ode"
        ],
        device: Union[str, torch.device],
        posterior_parameters: PosteriorParameters,
    ) -> NeuralPosterior:
        """
        Create a posterior object using the specified inference method.
    
        Depending on the value of `sample_with`, this method instantiates one of the
        supported posterior inference strategies.
    
        Args:
            estimator: The estimator that the posterior is based on.
            prior: A probability distribution that expresses prior knowledge about the
                parameters, e.g. which ranges are meaningful for them. Must be a PyTorch
                distribution, see FAQ for details on how to use custom distributions.
            sample_with: The inference method to use. Must be one of:
                - "mcmc"
                - "rejection"
                - "vi"
                - "importance"
                - "direct"
                - "sde"
                - "ode"
            device: torch device on which to train the neural net and on which to
                perform all posterior operations, e.g. gpu or cpu.
            posterior_parameters: Configuration passed to the init method for the
                posterior. Must be of type PosteriorParameters.
    
        Returns:
            NeuralPosterior object.
        """
    
        if isinstance(posterior_parameters, DirectPosteriorParameters):
            posterior_estimator = estimator
            if not isinstance(posterior_estimator, ConditionalDensityEstimator):
                raise TypeError(
                    f"Expected posterior_estimator to be an instance of "
                    " ConditionalDensityEstimator, "
                    f"but got {type(posterior_estimator).__name__} instead."
                )
            posterior = DirectPosterior(
                posterior_estimator=posterior_estimator,
                prior=prior,
                device=device,
                **asdict(posterior_parameters),
            )
        elif isinstance(posterior_parameters, VectorFieldPosteriorParameters):
            vector_field_estimator = estimator
            if not isinstance(vector_field_estimator, ConditionalVectorFieldEstimator):
                raise TypeError(
                    f"Expected vector_field_estimator to be an instance of "
                    " ConditionalVectorFieldEstimator, "
                    f"but got {type(vector_field_estimator).__name__} instead."
                )
            if sample_with not in ("ode", "sde"):
                raise ValueError(
                    "`sample_with` must be either",
                    f" 'ode' or 'sde', got '{sample_with}'",
                )
            posterior = VectorFieldPosterior(
                vector_field_estimator=vector_field_estimator,
                prior=prior,
                device=device,
                sample_with=sample_with,
                **asdict(posterior_parameters),
            )
        else:
            # Posteriors requiring potential_fn and theta_transform
            potential_fn, theta_transform = self._get_potential_function(
                prior, estimator
            )
            if isinstance(posterior_parameters, MCMCPosteriorParameters):
                posterior = MCMCPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, RejectionPosteriorParameters):
                posterior = RejectionPosterior(
                    potential_fn=potential_fn,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, VIPosteriorParameters):
>               posterior = VIPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    prior=prior,
                    device=device,
                    **asdict(posterior_parameters),
E                   TypeError: VIPosterior.__init__() got an unexpected keyword argument 'num_transforms'

.../inference/trainers/base.py:899: TypeError

tests/mnle_test.py::test_mnle_api[none-zuko_nsf-vi]

Stack Traces | 0.111s run time

flow_model = 'zuko_nsf', sampler = 'vi'
mcmc_params_fast = MCMCPosteriorParameters(method='slice_np_vectorized', thin=1, warmup_steps=1, num_chains=1, init_strategy='resample', init_strategy_parameters=None, num_workers=1, mp_context='spawn')
z_score_theta = 'none'

    @pytest.mark.parametrize(
        "sampler", (pytest.param("mcmc", marks=pytest.mark.mcmc), "rejection", "vi")
    )
    @pytest.mark.parametrize("flow_model", ("mdn", "nsf", "zuko_nsf"))
    @pytest.mark.parametrize("z_score_theta", ("independent", "none"))
    def test_mnle_api(
        flow_model: str,
        sampler,
        mcmc_params_fast: MCMCPosteriorParameters,
        z_score_theta: str,
    ):
        """Test MNLE API."""
        # Generate mixed data.
        num_simulations = 10
        theta = torch.rand(num_simulations, 2)
        x = torch.cat(
            (
                torch.rand(num_simulations, 1),
                torch.randint(0, 2, (num_simulations, 1)),
            ),
            dim=1,
        )
    
        # Train and infer.
        prior = BoxUniform(torch.zeros(2), torch.ones(2))
        x_o = x[0]
        # Build estimator manually.
        theta_embedding = FCEmbedding(2, 2)  # simple embedding net
        density_estimator = likelihood_nn(
            model="mnle",
            flow_model=flow_model,
            z_score_theta=z_score_theta,
            embedding_net=theta_embedding,
        )
        trainer = MNLE(density_estimator=density_estimator)
        trainer.append_simulations(theta, x).train(max_num_epochs=1)
    
        # Test different samplers.
>       posterior = trainer.build_posterior(prior=prior, sample_with=sampler)

tests/mnle_test.py:132: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
.../trainers/nle/mnle.py:176: in build_posterior
    return super().build_posterior(
.../trainers/nle/nle_base.py:291: in build_posterior
    return super().build_posterior(
.../inference/trainers/base.py:507: in build_posterior
    self._posterior = self._create_posterior(
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <sbi.inference.trainers.nle.mnle.MNLE object at 0x7f215ab33f40>
estimator = MixedDensityEstimator(
  (net): ZukoFlow(
    (net): Flow(
      (transform): LazyComposedTransform(
        (0): Unco...bias=True)
      (1): ReLU()
      (2): Linear(in_features=50, out_features=2, bias=True)
      (3): ReLU()
    )
  )
)
prior = BoxUniform(Uniform(low: torch.Size([2]), high: torch.Size([2])), 1)
sample_with = 'vi', device = 'cpu'
posterior_parameters = VIPosteriorParameters(q='maf', vi_method='rKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent')

    def _create_posterior(
        self,
        estimator: ConditionalEstimator,
        prior: Distribution,
        sample_with: Literal[
            "mcmc", "rejection", "vi", "importance", "direct", "sde", "ode"
        ],
        device: Union[str, torch.device],
        posterior_parameters: PosteriorParameters,
    ) -> NeuralPosterior:
        """
        Create a posterior object using the specified inference method.
    
        Depending on the value of `sample_with`, this method instantiates one of the
        supported posterior inference strategies.
    
        Args:
            estimator: The estimator that the posterior is based on.
            prior: A probability distribution that expresses prior knowledge about the
                parameters, e.g. which ranges are meaningful for them. Must be a PyTorch
                distribution, see FAQ for details on how to use custom distributions.
            sample_with: The inference method to use. Must be one of:
                - "mcmc"
                - "rejection"
                - "vi"
                - "importance"
                - "direct"
                - "sde"
                - "ode"
            device: torch device on which to train the neural net and on which to
                perform all posterior operations, e.g. gpu or cpu.
            posterior_parameters: Configuration passed to the init method for the
                posterior. Must be of type PosteriorParameters.
    
        Returns:
            NeuralPosterior object.
        """
    
        if isinstance(posterior_parameters, DirectPosteriorParameters):
            posterior_estimator = estimator
            if not isinstance(posterior_estimator, ConditionalDensityEstimator):
                raise TypeError(
                    f"Expected posterior_estimator to be an instance of "
                    " ConditionalDensityEstimator, "
                    f"but got {type(posterior_estimator).__name__} instead."
                )
            posterior = DirectPosterior(
                posterior_estimator=posterior_estimator,
                prior=prior,
                device=device,
                **asdict(posterior_parameters),
            )
        elif isinstance(posterior_parameters, VectorFieldPosteriorParameters):
            vector_field_estimator = estimator
            if not isinstance(vector_field_estimator, ConditionalVectorFieldEstimator):
                raise TypeError(
                    f"Expected vector_field_estimator to be an instance of "
                    " ConditionalVectorFieldEstimator, "
                    f"but got {type(vector_field_estimator).__name__} instead."
                )
            if sample_with not in ("ode", "sde"):
                raise ValueError(
                    "`sample_with` must be either",
                    f" 'ode' or 'sde', got '{sample_with}'",
                )
            posterior = VectorFieldPosterior(
                vector_field_estimator=vector_field_estimator,
                prior=prior,
                device=device,
                sample_with=sample_with,
                **asdict(posterior_parameters),
            )
        else:
            # Posteriors requiring potential_fn and theta_transform
            potential_fn, theta_transform = self._get_potential_function(
                prior, estimator
            )
            if isinstance(posterior_parameters, MCMCPosteriorParameters):
                posterior = MCMCPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, RejectionPosteriorParameters):
                posterior = RejectionPosterior(
                    potential_fn=potential_fn,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, VIPosteriorParameters):
>               posterior = VIPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    prior=prior,
                    device=device,
                    **asdict(posterior_parameters),
E                   TypeError: VIPosterior.__init__() got an unexpected keyword argument 'num_transforms'

.../inference/trainers/base.py:899: TypeError

tests/mnle_test.py::test_mnle_api[independent-zuko_nsf-vi]

Stack Traces | 0.117s run time

flow_model = 'zuko_nsf', sampler = 'vi'
mcmc_params_fast = MCMCPosteriorParameters(method='slice_np_vectorized', thin=1, warmup_steps=1, num_chains=1, init_strategy='resample', init_strategy_parameters=None, num_workers=1, mp_context='spawn')
z_score_theta = 'independent'

    @pytest.mark.parametrize(
        "sampler", (pytest.param("mcmc", marks=pytest.mark.mcmc), "rejection", "vi")
    )
    @pytest.mark.parametrize("flow_model", ("mdn", "nsf", "zuko_nsf"))
    @pytest.mark.parametrize("z_score_theta", ("independent", "none"))
    def test_mnle_api(
        flow_model: str,
        sampler,
        mcmc_params_fast: MCMCPosteriorParameters,
        z_score_theta: str,
    ):
        """Test MNLE API."""
        # Generate mixed data.
        num_simulations = 10
        theta = torch.rand(num_simulations, 2)
        x = torch.cat(
            (
                torch.rand(num_simulations, 1),
                torch.randint(0, 2, (num_simulations, 1)),
            ),
            dim=1,
        )
    
        # Train and infer.
        prior = BoxUniform(torch.zeros(2), torch.ones(2))
        x_o = x[0]
        # Build estimator manually.
        theta_embedding = FCEmbedding(2, 2)  # simple embedding net
        density_estimator = likelihood_nn(
            model="mnle",
            flow_model=flow_model,
            z_score_theta=z_score_theta,
            embedding_net=theta_embedding,
        )
        trainer = MNLE(density_estimator=density_estimator)
        trainer.append_simulations(theta, x).train(max_num_epochs=1)
    
        # Test different samplers.
>       posterior = trainer.build_posterior(prior=prior, sample_with=sampler)

tests/mnle_test.py:132: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
.../trainers/nle/mnle.py:176: in build_posterior
    return super().build_posterior(
.../trainers/nle/nle_base.py:291: in build_posterior
    return super().build_posterior(
.../inference/trainers/base.py:507: in build_posterior
    self._posterior = self._create_posterior(
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <sbi.inference.trainers.nle.mnle.MNLE object at 0x7f2e0420cca0>
estimator = MixedDensityEstimator(
  (net): ZukoFlow(
    (net): Flow(
      (transform): LazyComposedTransform(
        (0): Unco...     (1): ReLU()
        (2): Linear(in_features=50, out_features=2, bias=True)
        (3): ReLU()
      )
    )
  )
)
prior = BoxUniform(Uniform(low: torch.Size([2]), high: torch.Size([2])), 1)
sample_with = 'vi', device = 'cpu'
posterior_parameters = VIPosteriorParameters(q='maf', vi_method='rKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent')

    def _create_posterior(
        self,
        estimator: ConditionalEstimator,
        prior: Distribution,
        sample_with: Literal[
            "mcmc", "rejection", "vi", "importance", "direct", "sde", "ode"
        ],
        device: Union[str, torch.device],
        posterior_parameters: PosteriorParameters,
    ) -> NeuralPosterior:
        """
        Create a posterior object using the specified inference method.
    
        Depending on the value of `sample_with`, this method instantiates one of the
        supported posterior inference strategies.
    
        Args:
            estimator: The estimator that the posterior is based on.
            prior: A probability distribution that expresses prior knowledge about the
                parameters, e.g. which ranges are meaningful for them. Must be a PyTorch
                distribution, see FAQ for details on how to use custom distributions.
            sample_with: The inference method to use. Must be one of:
                - "mcmc"
                - "rejection"
                - "vi"
                - "importance"
                - "direct"
                - "sde"
                - "ode"
            device: torch device on which to train the neural net and on which to
                perform all posterior operations, e.g. gpu or cpu.
            posterior_parameters: Configuration passed to the init method for the
                posterior. Must be of type PosteriorParameters.
    
        Returns:
            NeuralPosterior object.
        """
    
        if isinstance(posterior_parameters, DirectPosteriorParameters):
            posterior_estimator = estimator
            if not isinstance(posterior_estimator, ConditionalDensityEstimator):
                raise TypeError(
                    f"Expected posterior_estimator to be an instance of "
                    " ConditionalDensityEstimator, "
                    f"but got {type(posterior_estimator).__name__} instead."
                )
            posterior = DirectPosterior(
                posterior_estimator=posterior_estimator,
                prior=prior,
                device=device,
                **asdict(posterior_parameters),
            )
        elif isinstance(posterior_parameters, VectorFieldPosteriorParameters):
            vector_field_estimator = estimator
            if not isinstance(vector_field_estimator, ConditionalVectorFieldEstimator):
                raise TypeError(
                    f"Expected vector_field_estimator to be an instance of "
                    " ConditionalVectorFieldEstimator, "
                    f"but got {type(vector_field_estimator).__name__} instead."
                )
            if sample_with not in ("ode", "sde"):
                raise ValueError(
                    "`sample_with` must be either",
                    f" 'ode' or 'sde', got '{sample_with}'",
                )
            posterior = VectorFieldPosterior(
                vector_field_estimator=vector_field_estimator,
                prior=prior,
                device=device,
                sample_with=sample_with,
                **asdict(posterior_parameters),
            )
        else:
            # Posteriors requiring potential_fn and theta_transform
            potential_fn, theta_transform = self._get_potential_function(
                prior, estimator
            )
            if isinstance(posterior_parameters, MCMCPosteriorParameters):
                posterior = MCMCPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, RejectionPosteriorParameters):
                posterior = RejectionPosterior(
                    potential_fn=potential_fn,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, VIPosteriorParameters):
>               posterior = VIPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    prior=prior,
                    device=device,
                    **asdict(posterior_parameters),
E                   TypeError: VIPosterior.__init__() got an unexpected keyword argument 'num_transforms'

.../inference/trainers/base.py:899: TypeError

View the full list of 9 ❄️ flaky test(s)

tests/posterior_parameters_test.py::test_build_posterior_warns_on_conflicting_args[build_posterior_arguments1]

Flake rate in main: 33.33% (Passed 132 times, Failed 66 times)

Stack Traces | 0.041s run time

build_posterior_arguments = {'posterior_parameters': VIPosteriorParameters(q='maf', vi_method='fKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent'), 'vi_method': 'IW'}
get_inference = <sbi.inference.trainers.nre.nre_b.NRE_B object at 0x7eff5968e200>

    @pytest.mark.parametrize(
        "build_posterior_arguments",
        [
            dict(
                mcmc_method="slice_pymc",
                posterior_parameters=MCMCPosteriorParameters(method="hmc_pyro"),
            ),
            dict(
                vi_method="IW",
                posterior_parameters=VIPosteriorParameters(vi_method="fKL"),
            ),
        ],
    )
    def test_build_posterior_warns_on_conflicting_args(
        build_posterior_arguments, get_inference
    ):
        """
        Test that build_posterior raises a UserWarning on conflicting parameter
        combinations.
        """
        inference = get_inference
    
        with pytest.warns(UserWarning, match="ignored in favor of"):
>           inference.build_posterior(**build_posterior_arguments)

tests/posterior_parameters_test.py:198: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
.../trainers/nre/nre_base.py:360: in build_posterior
    return super().build_posterior(
.../inference/trainers/base.py:507: in build_posterior
    self._posterior = self._create_posterior(
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <sbi.inference.trainers.nre.nre_b.NRE_B object at 0x7eff5968e200>
estimator = RatioEstimator(
  (net): ResidualNet(
    (initial_layer): Linear(in_features=6, out_features=50, bias=True)
    (bloc...Standardize()
    (1): Identity()
  )
  (embedding_net_x): Sequential(
    (0): Standardize()
    (1): Identity()
  )
)
prior = BoxUniform(Uniform(low: torch.Size([3]), high: torch.Size([3])), 1)
sample_with = 'mcmc', device = 'cpu'
posterior_parameters = VIPosteriorParameters(q='maf', vi_method='fKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent')

    def _create_posterior(
        self,
        estimator: ConditionalEstimator,
        prior: Distribution,
        sample_with: Literal[
            "mcmc", "rejection", "vi", "importance", "direct", "sde", "ode"
        ],
        device: Union[str, torch.device],
        posterior_parameters: PosteriorParameters,
    ) -> NeuralPosterior:
        """
        Create a posterior object using the specified inference method.
    
        Depending on the value of `sample_with`, this method instantiates one of the
        supported posterior inference strategies.
    
        Args:
            estimator: The estimator that the posterior is based on.
            prior: A probability distribution that expresses prior knowledge about the
                parameters, e.g. which ranges are meaningful for them. Must be a PyTorch
                distribution, see FAQ for details on how to use custom distributions.
            sample_with: The inference method to use. Must be one of:
                - "mcmc"
                - "rejection"
                - "vi"
                - "importance"
                - "direct"
                - "sde"
                - "ode"
            device: torch device on which to train the neural net and on which to
                perform all posterior operations, e.g. gpu or cpu.
            posterior_parameters: Configuration passed to the init method for the
                posterior. Must be of type PosteriorParameters.
    
        Returns:
            NeuralPosterior object.
        """
    
        if isinstance(posterior_parameters, DirectPosteriorParameters):
            posterior_estimator = estimator
            if not isinstance(posterior_estimator, ConditionalDensityEstimator):
                raise TypeError(
                    f"Expected posterior_estimator to be an instance of "
                    " ConditionalDensityEstimator, "
                    f"but got {type(posterior_estimator).__name__} instead."
                )
            posterior = DirectPosterior(
                posterior_estimator=posterior_estimator,
                prior=prior,
                device=device,
                **asdict(posterior_parameters),
            )
        elif isinstance(posterior_parameters, VectorFieldPosteriorParameters):
            vector_field_estimator = estimator
            if not isinstance(vector_field_estimator, ConditionalVectorFieldEstimator):
                raise TypeError(
                    f"Expected vector_field_estimator to be an instance of "
                    " ConditionalVectorFieldEstimator, "
                    f"but got {type(vector_field_estimator).__name__} instead."
                )
            if sample_with not in ("ode", "sde"):
                raise ValueError(
                    "`sample_with` must be either",
                    f" 'ode' or 'sde', got '{sample_with}'",
                )
            posterior = VectorFieldPosterior(
                vector_field_estimator=vector_field_estimator,
                prior=prior,
                device=device,
                sample_with=sample_with,
                **asdict(posterior_parameters),
            )
        else:
            # Posteriors requiring potential_fn and theta_transform
            potential_fn, theta_transform = self._get_potential_function(
                prior, estimator
            )
            if isinstance(posterior_parameters, MCMCPosteriorParameters):
                posterior = MCMCPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, RejectionPosteriorParameters):
                posterior = RejectionPosterior(
                    potential_fn=potential_fn,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, VIPosteriorParameters):
>               posterior = VIPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    prior=prior,
                    device=device,
                    **asdict(posterior_parameters),
E                   TypeError: VIPosterior.__init__() got an unexpected keyword argument 'num_transforms'

.../inference/trainers/base.py:899: TypeError

tests/posterior_parameters_test.py::test_build_posterior_works_on_default_args[build_posterior_arguments1]

Flake rate in main: 33.33% (Passed 132 times, Failed 66 times)

Stack Traces | 0.054s run time

build_posterior_arguments = {'posterior_parameters': VIPosteriorParameters(q='maf', vi_method='rKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent')}
get_inference = <sbi.inference.trainers.nre.nre_b.NRE_B object at 0x7f0fd8b693f0>

    @pytest.mark.parametrize(
        "build_posterior_arguments",
        [
            pytest.param(
                dict(
                    posterior_parameters=MCMCPosteriorParameters(
                        method="slice_np_vectorized"
                    ),
                ),
            ),
            pytest.param(
                dict(
                    posterior_parameters=VIPosteriorParameters(vi_method="rKL"),
                ),
            ),
        ],
    )
    def test_build_posterior_works_on_default_args(
        build_posterior_arguments, get_inference
    ):
        """
        Test that build_posterior doesn't raise on default parameters.
        """
    
        inference = get_inference
>       inference.build_posterior(**build_posterior_arguments)

tests/posterior_parameters_test.py:226: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
.../trainers/nre/nre_base.py:360: in build_posterior
    return super().build_posterior(
.../inference/trainers/base.py:507: in build_posterior
    self._posterior = self._create_posterior(
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <sbi.inference.trainers.nre.nre_b.NRE_B object at 0x7f0fd8b693f0>
estimator = RatioEstimator(
  (net): ResidualNet(
    (initial_layer): Linear(in_features=6, out_features=50, bias=True)
    (bloc...Standardize()
    (1): Identity()
  )
  (embedding_net_x): Sequential(
    (0): Standardize()
    (1): Identity()
  )
)
prior = BoxUniform(Uniform(low: torch.Size([3]), high: torch.Size([3])), 1)
sample_with = 'mcmc', device = 'cpu'
posterior_parameters = VIPosteriorParameters(q='maf', vi_method='rKL', num_transforms=5, hidden_features=50, z_score_theta='independent', z_score_x='independent')

    def _create_posterior(
        self,
        estimator: ConditionalEstimator,
        prior: Distribution,
        sample_with: Literal[
            "mcmc", "rejection", "vi", "importance", "direct", "sde", "ode"
        ],
        device: Union[str, torch.device],
        posterior_parameters: PosteriorParameters,
    ) -> NeuralPosterior:
        """
        Create a posterior object using the specified inference method.
    
        Depending on the value of `sample_with`, this method instantiates one of the
        supported posterior inference strategies.
    
        Args:
            estimator: The estimator that the posterior is based on.
            prior: A probability distribution that expresses prior knowledge about the
                parameters, e.g. which ranges are meaningful for them. Must be a PyTorch
                distribution, see FAQ for details on how to use custom distributions.
            sample_with: The inference method to use. Must be one of:
                - "mcmc"
                - "rejection"
                - "vi"
                - "importance"
                - "direct"
                - "sde"
                - "ode"
            device: torch device on which to train the neural net and on which to
                perform all posterior operations, e.g. gpu or cpu.
            posterior_parameters: Configuration passed to the init method for the
                posterior. Must be of type PosteriorParameters.
    
        Returns:
            NeuralPosterior object.
        """
    
        if isinstance(posterior_parameters, DirectPosteriorParameters):
            posterior_estimator = estimator
            if not isinstance(posterior_estimator, ConditionalDensityEstimator):
                raise TypeError(
                    f"Expected posterior_estimator to be an instance of "
                    " ConditionalDensityEstimator, "
                    f"but got {type(posterior_estimator).__name__} instead."
                )
            posterior = DirectPosterior(
                posterior_estimator=posterior_estimator,
                prior=prior,
                device=device,
                **asdict(posterior_parameters),
            )
        elif isinstance(posterior_parameters, VectorFieldPosteriorParameters):
            vector_field_estimator = estimator
            if not isinstance(vector_field_estimator, ConditionalVectorFieldEstimator):
                raise TypeError(
                    f"Expected vector_field_estimator to be an instance of "
                    " ConditionalVectorFieldEstimator, "
                    f"but got {type(vector_field_estimator).__name__} instead."
                )
            if sample_with not in ("ode", "sde"):
                raise ValueError(
                    "`sample_with` must be either",
                    f" 'ode' or 'sde', got '{sample_with}'",
                )
            posterior = VectorFieldPosterior(
                vector_field_estimator=vector_field_estimator,
                prior=prior,
                device=device,
                sample_with=sample_with,
                **asdict(posterior_parameters),
            )
        else:
            # Posteriors requiring potential_fn and theta_transform
            potential_fn, theta_transform = self._get_potential_function(
                prior, estimator
            )
            if isinstance(posterior_parameters, MCMCPosteriorParameters):
                posterior = MCMCPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, RejectionPosteriorParameters):
                posterior = RejectionPosterior(
                    potential_fn=potential_fn,
                    proposal=prior,
                    device=device,
                    **asdict(posterior_parameters),
                )
            elif isinstance(posterior_parameters, VIPosteriorParameters):
>               posterior = VIPosterior(
                    potential_fn=potential_fn,
                    theta_transform=theta_transform,
                    prior=prior,
                    device=device,
                    **asdict(posterior_parameters),
E                   TypeError: VIPosterior.__init__() got an unexpected keyword argument 'num_transforms'

.../inference/trainers/base.py:899: TypeError

tests/posterior_parameters_test.py::test_signature_consistency[VIPosteriorParameters-VIPosterior-skipped_fields_and_parameters4]

Flake rate in main: 33.33% (Passed 132 times, Failed 66 times)

Stack Traces | 0.004s run time

parameter_dataclass = <class 'sbi.inference.posteriors.posterior_parameters.VIPosteriorParameters'>
init_target_class = <class 'sbi.inference.posteriors.vi_posterior.VIPosterior'>
skipped_fields_and_parameters = {'device', 'potential_fn', 'prior', 'self', 'theta_transform', 'x_shape'}

    @pytest.mark.parametrize(
        ("parameter_dataclass", "init_target_class", "skipped_fields_and_parameters"),
        [
            (
                DirectPosteriorParameters,
                DirectPosterior,
                {"posterior_estimator", "prior", "device"},
            ),
            (
                ImportanceSamplingPosteriorParameters,
                ImportanceSamplingPosterior,
                {"potential_fn", "proposal", "device"},
            ),
            (
                MCMCPosteriorParameters,
                MCMCPosterior,
                {
                    "potential_fn",
                    "proposal",
                    "device",
                    "theta_transform",
                    "init_strategy_num_candidates",
                },
            ),
            (
                RejectionPosteriorParameters,
                RejectionPosterior,
                {"potential_fn", "device", "proposal"},
            ),
            (
                VIPosteriorParameters,
                VIPosterior,
                {"potential_fn", "prior", "theta_transform", "device"},
            ),
            (
                VectorFieldPosteriorParameters,
                VectorFieldPosterior,
                {
                    "vector_field_estimator",
                    "device",
                    "prior",
                    "sample_with",
                    "iid_method",
                    "iid_params",
                    "neural_ode_backend",
                    "neural_ode_kwargs",
                },
            ),
            (
                VectorFieldPosteriorParameters,
                VectorFieldBasedPotential,
                {
                    "vector_field_estimator",
                    "device",
                    "prior",
                    "x_o",
                    "enable_transform",
                    "max_sampling_batch_size",
                },
            ),
        ],
    )
    def test_signature_consistency(
        parameter_dataclass, init_target_class, skipped_fields_and_parameters
    ):
        """
        Test that the constructor (__init__) signature of a target class matches the
        signature of a corresponding parameter dataclass.
    
        This function compares the argument names, default values, and type annotations
        between the dataclass and the target class __init__ method, ignoring specified
        parameters passed in `skipped_fields_and_parameters`.
    
        Args:
            parameter_dataclass: The dataclass whose signature is used as reference.
            init_target_class: The class whose __init__ method signature is compared.
            skipped_fields_and_parameters (set): A set of parameter names to ignore during
                comparison (e.g., 'self', or fields not relevant for matching).
    
        Raises:
            AssertionError: If there is any mismatch in parameter names, default values,
                or type annotations between the dataclass and the class constructor.
        """
        dataclass_signature = inspect.signature(parameter_dataclass)
        class_signature = inspect.signature(init_target_class.__init__)
    
        skipped_fields_and_parameters.add("self")
        skipped_fields_and_parameters.add("x_shape")
    
        class_dict = {
            name: param
            for name, param in class_signature.parameters.items()
            if name not in skipped_fields_and_parameters
            and param.kind != inspect.Parameter.VAR_KEYWORD
        }
    
        dataclass_dict = {
            name: param
            for name, param in dataclass_signature.parameters.items()
            if name not in skipped_fields_and_parameters
        }
    
        # Compare if the dataclass and posterior_class have the same argument names
>       assert class_dict.keys() == dataclass_dict.keys(), (
            f"Parameter mismatch:\n"
            f"In class but not dataclass: {class_dict.keys() - dataclass_dict.keys()}\n"
            f"In dataclass but not class: {dataclass_dict.keys() - class_dict.keys()}"
        )
E       AssertionError: Parameter mismatch:
E         In class but not dataclass: {'parameters', 'modules'}
E         In dataclass but not class: {'hidden_features', 'z_score_theta', 'z_score_x', 'num_transforms'}
E       assert dict_keys(['q...', 'modules']) == dict_keys(['q... 'z_score_x'])
E         
E         Full diff:
E         - dict_keys(['q', 'vi_method', 'num_transforms', 'hidden_features', 'z_score_theta', 'z_score_x'])
E         + dict_keys(['q', 'vi_method', 'parameters', 'modules'])

tests/posterior_parameters_test.py:148: AssertionError

tests/save_and_load_test.py::test_picklability[NRE_B-VIPosteriorParameters]